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Abstract: Recent works have related the bulk first law of black hole mechanics to the first 
law of entanglement in a dual CFT. These are first order relations, and receive corrections 
for finite changes. In particular, the latter is naively expected to be accurate only for small 
changes in the quantum state. But when Newton’s constant is small relative to the AdS 
scale, the former holds to good approximation even for classical perturbations that contain 
many quanta. This suggests that - for appropriate states - corrections to the first law of 
entanglement are suppressed by powers of N in CFTs whose correlators satisfy ’t Hooft 
large-AI power counting. We take first steps toward verifying that this is so by studying 
the large-A structure of the entropy of spatial regions for a class of CFT states motivated 
by those created from the vacuum by acting with real-time single-trace sources. We show 
that 1/A counting matches bulk predictions, though we require the effect of the source on 
the modular hamiltonian to be non-singular. The magnitude of our sources is eA with e 
fixed-but-small as A —)> oo. Our results also provide a perturbative derivation - without 
relying on the replica trick - of the subleading Faulkner-Lewkowycz-Maldacena correction 
to the Ryu-Takayagi and Hubeny-Rangamani-Takayanagi conjectures at all orders in 1 /A. 
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1 Introduction 

There has been much activity exploring the intriguing connection between entanglement in 
holographic held theories and the gravitational held equations of the bulk dual. This pro¬ 
gram traces its roots to Jacobson’s seminal paper [1], which proposed the Einstein equation 
to be a thermodynamic equation of state for some unknown quantum mechanical system 
in which the area of surfaces measures entanglement entropy across causal horizons. Sev¬ 
eral groups [2-5] have now used related arguments to derive the linearized gravitational 
held equations in the context of the anti-de Sitter/conformal held theory (AdS/CFT) cor¬ 
respondence, where the underlying quantum system is well understood; see also the related 
works [6-8]. A key assumption in these more recent derivations is the Ryu-Takayangi (RT) 
conjecture [9, 10] and its covariant generalization by Hubeny-Rangamani-Takayanagi (HRT) 
[11], both of which relate entanglement entropy of subregions of the held theory to the ge¬ 
ometry of bulk surfaces. A partial converse of this result - that the bulk held equations 
imply aspects of the Ryu-Takayanagi conjecture - has also been argued by Lewkowycz and 
Maldaena [12], though see [13] for a discussion of the so-called homology constraint, [14] for 
questions about the possible role of complex bulk surfaces, and [15] for concerns regarding 
strong time-dependence. 
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Much of the recent discussion has centered on the so-called first law of entanglement 

= d{HA ). ( 1 . 1 ) 

Here H is a subregion of some Cauchy surface for the CFT, Sa ■= ~ Tr(/9yi log(/9A)) is the 
von Neumann entropy of the associated reduced density matrix pA-, Ha ■= — log{pA) is the 
modular Hamiltonian, and 5 denotes the hrst variation with respect to the state when the 
operator Ha is held hxed on the right hand side. The relation (1.1) holds at first order, 
but receives corrections for hnite changes. 

According to the HRT conjecture, in holographic theories the left hand side of (1.1) can 
be extracted from the bulk geometry. The right hand side is typically difficult to evaluate, 
though it reduces to a simple integral of the CFT stress tensor [16] (see also [17]) when A is 
a ball-shaped region and the system is in its global vacuum state. The combination of these 
two results makes (1.1) a useful formula for studying the relationship between entanglement 
and geometry. In particular, in this context (1.1) coincides with the hrst law of black hole 
mechanics in the bulk applied to the Rindler-like Killing horizon dehned by a ball-shaped 
region A on the AdS boundary [17]. This argument for the hrst law can then be inverted 
to derive linearized bulk equations of motion from HRT [2-5]. 

The starting point for our work is the observation (see e.g. [4]) that the entanglement 
hrst law (1.1) is generally useful only for inhnitesimal changes in the quantum state; higher 
order corrections tend to make signihcant contributions when it instead undergoes any sub¬ 
stantial change. In contrast, the bulk hrst law of black hole mechanics accurately describes 
classical deformations - typically involving very large numbers of quanta - so long as the 
changes in entropy and energy are small in comparison with their background values. In 
particular, corrections to the bulk hrst law stem from gravitational back-reaction and are 
thus suppressed by powers of the bulk Newton constant Gjsf. This suggests that corrections 
to the entanglement hrst law (1.1) will be correspondingly suppressed, at least for states 
that would be appropriately semi-classical with respect to the bulk. Our goal below is to 
show suppressions by powers of N in similar computations involving the entropy of spatial 
regions for CFTs whose correlators satisfy’t Hooft power counting and whose spectrum of 
light operators is sufficiently sparse. We consider a class of CFT states motivated by those 
created from the vacuum by acting with real-time single-trace sources, though we require 
the source to have non-singular effects on the modular Hamiltonian. In order to model 
sources that would produce semi-classical coherent states in any bulk dual, the magnitude 
of our sources is taken to be eN with e hxed-but-small as N ^ oo. See also [18] for other 
investigations of entanglement in large N gauge theories. 

For such states, we verify bulk predictions of powers of N via a direct calculation in 
the CFT. Though it differs in detail, the suppression found here is analogous to the large 
N suppression of such corrections found previously in [19, 20]. Our analysis also provide 
additional benefits. First, our explicit formula for second order relative entropy makes 
manifest the agreement with appropriately-integrated bulk stress tensors - here dehned 
to include contributions from the stress tensor of bulk gravitons - required by comparing 
corrections to the bulk and boundary hrst laws [21, 22]. As a result, it again demonstrates 
that a consistent holographic theory of gravity must couple universally to all forms of 
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bulk stress-energy. It also provides a perturbative derivation of the Faulkner-Lewkowycz- 
Maldacena subleading correction to the RT and HRT conjectures at all orders in 1/A^ [23]. 

Before proceeding, we should elaborate on the above restriction to sources with non¬ 
singular action on the modular Hamiltonian. As explained in section 2 below, from a dual 
bulk point of view this requires our perturbations to vanish in some neighborhood of the 
bifurcation surface of our bulk Rindler-like horizon. So the leading-order large-A^ RT or 
HRT entanglement cannot change. They do, however, affect the above-mentioned order 
entanglement. They also change {Ha) and thus the relative entropy Ra = {Ha) — Sa at 
order N‘^. We will show that these powers of N are correct at all orders in e; the fact that 
AS A is of smaller order in N than Sa itself is the 1/N suppression advertised above. 

The rest of the paper is organized as follows. We begin with a brief description of 
our setup in section 2. Section 3 then computes {Ha),Ra and Sa to second order for our 
family of states. It also argues to all orders that the powers of N in {Ha),Ra and Sa 
are precisely those predicted by intuition from a bulk dual. We conclude in section 4 with 
a brief discussion of our results as well as comments on possible extensions. Appendix A 
derives a simple and completely general formula (A. 14) for the second order change in Ra 
in bipartite quantum systems that gives the above-mentioned explicit formulas for Ra and 

A5a. 


2 Setting and assumptions 

We wish to study excitations of the vacuum |0) of a large N CFT in d spacetime dimensions 
on M X S'^~^. Since we take our inspiration from a possible bulk dual, we impose assump¬ 
tions similar to those in e.g. [24] and [25, 26], taking our CFT to satisfy’t Hooft large-A^ 
factorization [27] and to have a sparse spectrum of light operators. As usual, light operators 
are those whose scaling dimension Aj of 0*(x) remains finite as we take N ^ oo and the 
sparse spectrum condition requires that for any fixed A the number of such operators with 
A* < A remains finite at large N. 

The factorization condition states that the set of light single-trace gauge-invariant local 
operators should admit a basis {0*(x)} for which 

( 2 . 1 ) 

where {■ ■ ■)^ is the connected vacuum correlator and similar notation without the subscript 
c will also be employed for the full correlator. Furthermore, heavy operators (those that 
do not remain light as A —)> oo) decouple in the sense that connected correlators involving 
both heavy and light operators are much smaller. In fact, we assume that we study a low 
energy process from which operators of finite-but-large dimension are sufficiently decoupled 
that sums over the O* below always converge. Indeed, our only use of the sparse spectrum 
condition will be to assume that such sums with coefficients of order converge to a result 
of the same order in N. 

It will be convenient to take the basis operators O* to have definite scaling dimension 
Aj at large N and in fact to diagonalize the order term in the connected two-point 
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function. We also require 


( 2 . 2 ) 


(O*(x)>=0, 

for all i, which can be achieved by subtracting appropriate expectation values. Note that 
we have not required the O* to be scalars; we have merely suppressed any tensor or spinor 
indices. Thus, up to the above subtractions, one member (say, O^) of our basis is 1/N 
times the CFT stress tensor which necessarily satisfies A = d; i.e. = ^(T— (T)), where 
we have suppressed the spacetime indices ab. 

Our expectations that corrections to the first law of entanglement are suppressed arise 
from considering the semi-classical behavior of a supposed bulk gravitational dual. Semi- 
classical bulk states can be created from the vacuum through the action of large classical 
sources for the perturbative bulk fields. As usual, we may choose to locate these sources at 
the boundary where they may be translated into sources for the local single-trace O* above. 
Now, single and multi-trace sources mix under time-evolution, but this mixing is again 
controlled by the 1/N expansion: since the stress tensor generates time evolution, to any 
order in 1/A^ a light operator 0{x) can be replaced inside such correlators with an operator 
at another time that is a sum over k of fc-trace terms weighted by Semi-classical 

behavior is preserved in time, so it should suffice to restrict attention to states of the form 

|a) := 17|0) , U := (2.3a) 

j:=Y^N-{k-i) f (2.3b) 

k=i 

where the classical sources ^re fixed smooth c-number functions of order and we 

allow distributional terms so that terms in the /c-trace contributions to (2.3b) may effectively 
include only m < k integrals over operator location. 

We have introduced the real number a to be used as an expansion parameter. The 
symbol T denotes time-ordering, and we employ the convenient abuse of notation that 
defines to be the natural time-ordered exponential associated with the particular 

representation of J given above as an integral over the CFT spacetime. Using the standard 
AdS/CFT dictionary, the above normalizations would give the bulk field (f)'' dual to O® an 
expectation value {(l))a ~ oi- 

We wish to choose a so that (2.3) would behave semi-classically in a bulk gravitational 
dual. Since the bulk is perturbative at large N^ bulk quantum fluctuations become negligible 
in the limit 1 and gravitational back reaction scales like 

cm - ^ . (2.4) 

To allow this effect to be as large as possible consistent with a perturbative treatment we 
take 


a = eN S> 1 

where e is a small parameter to be held constant in the limit N ^ oo. 
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(2.5) 


Our particular interest is in the effect of such sources on a region A of some Cauchy 
surface S in the CFT spacetime, or more generally on the associated domain of dependence 
D{A). The region A is held hxed as we take N —)• oo. We denote the complementary region 
on this Cauchy surface by A^, so that S = A U A'^. As a density matrix, the state |a) 
is cTq, := |Q;)(a!|. The associated reduced density matrices and the un-perturbed modular 
Hamiltonians for A and A^ are then 


PAa — PA TrAc((TQ,) , Ha — log(/JA(o=0)) ) 

PA<^a = PA<^ '■= Tryi(crQ), Ha<^ '■= — log(pA=(o=o)) • (2-6) 

As implied above, we will often suppress the label a on pA, PA‘^ when the meaning is clear, 
though Ha, Ha<^ will always represent the modular Hamiltonians at a = 0. An important 
property of these objects is 

[Ha,HaA = Q, (2.7) 

where both Ha and Ha<^ are interpreted as operators on the full CFT Hilbert space by 
tensoring each with the identity operator or lyi) on the complementary region. The 
result (2.7) would be immediate for truly bipartite systems that decompose as a tensor 
product of a system on A with a system on A'^. It is less trivial in quantum held theory, 
but still holds for ball-shaped regions A in CFTs since the vacuum is invariant under a Z2 
conformal symmetry that exchanges A and This symmetry must then exchange Ha 
and Ha^ as well, requiring their commutator to be symmetric. The manifest antisymmetry 
of [Ha, then forces the commutator to vanish. 

It is also useful to dehne the operator K := Ha — Ha<^- Recall that K annihilates the 
vacuum: 


K\0) = {Ha - HA^m = 0 . (2.8) 

For ball-shaped regions A in a constant-time slice, K generates the conformal isometry of 
M ® 5'^-^ that moves A orthogonally to itself while preserving the domain of dependence 
D(A); i.e., it generates the natural forward “time-translation” on D{A) and the natural 
backward “time-translation” on D{A^) [16]. For lack of a better name, we will refer to K 
as the boost Hamiltonian, though this term is only an accurate description of K in the 
large-sphere limit when A becomes a half-plane. 

Finally, we introduce a basis for the space of light local gauge-invariant single trace 
operators {0\{x)} {{0\c{x)}) on the domain of dependence D{A) {D{A^)) that again 
satisfy (2.1) and (2.2). Below we will consider ball-shaped regions in a constant-time Cauchy 
surface S = S'^~^ so that the modular Hamiltonians Ha, Ha<^ can be expressed as integrals 
of the stress tensor over A, A^ in any CFT [16]. 

2.1 Adapting the source 

A key step in our argument will be to write the source-operator J from (2.3b) in a manner 
adapted to the decomposition S = AuA'^. The basic idea is to use the Heisenberg equations 
of motion for the CFT to express J in terms of operators on S. Thinking of the CFT Hilbert 
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space as a tensor product of separate Hilbert spaces for A and A'^ would then allow J to 
be written as a sum of terms, each of which is the tensor product of a (possibly trivial) 
operator on A with one on A^. 

J = (2.9) 

a 

We would like to then expand the J^, as a sum or products of the {0\{x)'\ ({0(^c(x)}) 
and use (2.1) to count powers of N. 

There are, however, several potential obstacles to consider. First, evolving the source to 
the Cauchy surface S will generally mix the Oi with both heavy operators and highly non¬ 
local expressions such as Wilson loops that cannot be expanded in terms of the {O)^(x)} 
{{0\c{x)}). However, our calculations below will involve only correlation functions of 
sources with For the ball-shaped regions to be considered, up to the choice of 

zero-point our Ha, Ha^^ are integrals of the stress tensor, which is one of our light operators. 
Since the stress tensor also generates time evolution, to any order in 1/N a light operator 
0(x) can be replaced inside such correlators with an appropriately-smeared sum of products 
of {Ol^(x)} {{0\c{x)}). So in this sense we may write 

J = Oa + [ dxi.. .dxkjii...ikixi,... ,xk)0'‘^{xi).. .0'‘'‘{xi), 

N J[DiA)uDiA-)]'^ 

( 2 . 10 ) 

where we have used the fact that operator mixing is suppressed by powers of 1/A^ and 
defined 

Oa= [ j^'{xm{x), Oa^= [ jfix)0\.ix). (2.11) 

Jd{A) Jd{A‘^) 

The combinations of operators represented by Oa, will play important roles below. 
Although not explicitly indicated, we will make use of a similar decomposition of the multi¬ 
trace parts of (2.10) into sums of products of operators associated separately with A and 
A'^. Since (2.3) requires time-ordered products of such operators, we should really perform 
an expansion of the form (2.10) to express the source J(t} at each time in terms of operators 
that evolve to the regions A, A‘^, but to avoid clutter in our notation we will not explicitly 
indicate the time at which given operators are to act. 

In terms of a holographic bulk dual, the rewriting of (2.3) as (2.10) could be described 
as evolving sources to a bulk Cauchy surface intersecting the boundary at A U and then 
using e.g. the methods of [28-34] to write the corresponding bulk operators in terms of 
boundary operators^ in D{A),D{A^). At least for operators O* not associated with bulk 
gauge fields, this interpretation nicely side-steps issues (see e.g. [35-44]) associated with the 
expectation that our CFT will be a gauge theory so that its Gauss-law constraint forbids 
factorization of its Hilbert space into separate Hilbert spaces for A and A'^. 

However, even without a Gauss-law constraint, quantum field theory Hilbert spaces do 
not admit a precise tensor product structure. This fact is associated with singularities in 

^ Since our analysis is perturbative, this construction may be performed in empty AdS so that ball-shaped 
regions A, A'^ define Rindler-AdS regions with a common bifurcate Rindler horizon. 
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various n-point correlation functions which can give contributions localized precisely on the 
boundary dA where A and A^ meet.^ The issue could be ignored if we were to consider 
only correlators smeared with smooth functions; we could simply deform the smearing 
functions so that they vanish in some small neighborhood of dA and then recover the 
original undeformed correlators by continuity as these neighborhoods shrink to zero size. 
But the fact that Ha,Hac are integrals of the stress tensor against non-smooth functions^ 
means that more care will be required. In section 3 below we simply restrict to sources that 
induce changes in energy A{Ha) and entropy AS a that can be approximated by replacing 
each term in (2.10) with a source that differs from the original in some neighborhood ^collar 
of dA and vanishes smoothly in a smaller neighborhood ^source-free of dA, and then letting 
the width of Acoiiar vanish. Roughly speaking, these are the sources that do not produce 
distributional terms localized at dA. We refer to such sources as having non-singular action 
on the modular Hamiltonian Ha- 

In practice, rather than working through the above limit explicitly, we will simply 
consider the regulated operators mentioned above which we take to vanish in a common 
neighborhood ^source-free of dA. The limit ^collar) ^source-free 0 will be left implicit. Such 
sources may for example be obtained by choosing the original stress-tensor sources to be 
supported in the interior of D{A) U D{A‘^). Nevertheless, the fact that the multi-trace 
terms in either (2.3) or (2.10) are multi-local, means that they can include products of 
operators in D{A) with those in D{A^). Thus U is generally not a product of separate 
unitary transformations on D{A) and D{A'^). 

3 Results with no singular terms 

We now study the changes in energy and entropy associated with applying U to |0) for the 
above sources. 

3.1 Energy 

Changes in the energy are straightforward to evaluate and take the form 

A{Ha) = {fe^^^HATe-^^-^ - Ha) 

= ieN{[J,HA]) - [J,Ha]]t) + 0{e^N^), (3.1) 

where T denotes anti-time-ordering and [J, [J, Ha]]t is the operation defined by the second 
order expansion of the first line; as indicated by the notation, one may think of this term 
as an appropriately time-ordered version of a double commutator. 

Since the first order term also clearly follows from the first line, it remains only to show 
that the omitted terms are of order and in particular involve no higher powers of 

N. We expect that this argument is also standard, but we state it here for completeness. 
The point is to note the repeated commutator structure of THaT— Ha, which 

^In the bulk description mentioned above, it is associated with the fact that operators at the bifurcation 
surface of the Rindler horizon cannot be mapped to either the D{A) or D{A^). 

®The first derivative of the smearing function is discontinuous at dA. 
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requires that at order a"' all n sources must be connected to each other and also to Ha- 
Considering for the moment only the single-trace contribution Oa + Oa<^ to all sources J 
from (2.10) and using (2.1) then gives only terms of order 

One may then show that multi-trace contributions are further suppressed by at least 
an additional N~'^: Since the coefficient of each multi-trace contribution to (2.10) contains 
an explicit factor of 1/A^, the only possible exception could come from including a single 
double-trace contribution 0*10*2 to one of the sources. But since (0*^) = (O®^) = 0, the 
repeated commutator structure again means that all that all non-zero contributions are fully 
connected. From (2.1), the replacement of a single-trace operator by O^iO®^ thus yields an 
extra 1/A^ in this connected correlator giving the stated suppression by two powers of 1/N. 

We should also comment further on the linear term in (3.1). Recall that we take sources 
in (2.10) to vanish near dA. This must in particular be true of the single-trace term, which 
can thus be written as Oa + Oa<^- Writing Ha as an integral of NO^ (plus a constant) 
and using (2.1) would suggest that this single-trace part makes the linear term of order 
eN"^. But the Oa<= term clearly commutes with Ha- And since [Oa,Ha<^] = 0 we have 
[Oa,Ha] = [Oa,K]. The fact that K annihilates the vacuum then implies {[Oa, Ha]) = 0. 
As a result, the linear term in (3.1) receives contributions only from multi-trace terms in 
(2.10) and is thus suppressed by an extra N~‘^ as described above. Thus the linear term is 
in fact of order eN^- 

3.2 Entropy 

Computing changes in entropy and relative entropy is more complicated than computing 
A{Ha)- The main issue is that expanding the logarithm in Sa requires using the highly non¬ 
trivial Baker-Campbell-Hausdorf (BCH) formula. Nevertheless, subject to the assumption 
that all sources in (2.10) vanish near dA, we will argue below that changes in the entropy 
Sa and relative entropy Ra take the form 

Ra ■-= A{Ha) - AS A = {e^^ApA[f{K) - f{K)]ApAe^^) + 0{e^N^) , (3.2a) 

= e'^N'^iOAKOA) + 0{e^N^) + Oie^N^) (3.2b) 

ASa = 0{eN°) (3.2c) 

where ApA = PA — PAO and / is the smooth function 

m = ■ ( 3 . 3 ) 

This / appears because it is the generating function of the Bernoulli numbers which play 
an important role in the BCH formula [45]. While it is generally redundant to describe 
{Ha), Sa, and Ra '-= {Ha) — Sa, we choose to do so ~ and in fact give two expressions 
for Ra ~ both in order to state definite results and to describe the relative sizes of various 
contributions. For example, the explicit term in (3.2a) is a general result for the second 
order term in the relative entropy Ra of any bipartite quantum system,^ but the large N 

^While continuum field theories are not truly bipartite, we may nevertheless use this formula due to our 
assumption that sources vanish near dA. See appendix A for details. 
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structure is more apparent from (3.2b). Since our current interest focuses on the latter, 
and since (3.2b) may be derived from more general arguments, we relegate the calculation 
leading to (3.2a) to appendix A. Equation (3.2a) may nevertheless be useful for applications 
that require subleading terms of order as well as the leading term of order 

Although deriving these results will require some work, the intuition behind (3.2b) and 
(3.2c) is easy to understand. Note that keeping only the single-trace term would make 
U a product of separate (commuting) unitary transformations and on A and A^. 
This would require = 0 and thus Ra = A{Ha)- So contributions to require 

the multi-trace terms in (2.10), which are of order N^. Deriving (3.2c) thus amounts to 
controlling cross-terms involving both single-trace and multi-trace source terms. This is 
done in section 3.3 below. 

The explicit term in (3.2b) is the second-order effect of purely single-trace sources on 
Ra, or equivalently on {Ha)- Since either can depend only on the restriction of the state 
to A, they are unchanged by the action of the defined above. We may thus consider 
only U^, which commutes with Ha<^ and thus induces identical changes in both Ha and K. 
Indeed, since the linear term in (3.1) vanishes for single-trace sources that vanish near dA, 
it suffices to compute only the second-order term 

-"^{[Oa,[Oa,K]]t). (3.4) 

Using Ar|0) = (OjAT = 0 then shows the only non-zero term to be the one displayed in 
(3.2b). Along with the fact that the first order change in Ra vanishes identically by the 
first law, the errors in (3.1) and (3.2c) then imply those in (3.2b). 

3.3 Multi-trace contributions to AS a 

The task that remains is to show that including multi-trace contributions to (2.10) can 
change Sa only by terms of order N^. The argument is somewhat lengthy, so we break it 
into several parts. We first reorganize the action of the single-trace source-terms in order to 
show that they have little impact. We then work to write a series expansion of Sa to which 
we can usefully apply the large-A counting rule (2.1). Here there are two difficulties, one of 
which is associated with the fact that Sa is defined as a non-linear function of the reduced 
density matrix pA on region A and not as a vacuum expectation value. Any power series 
expansion of Sa thus naturally involves many traces over A^ that must be eliminated in 
order to use (2.1). The other involves controlling contributions from possible disconnected 
correlators. After completing these tasks, we combine the results and count powers of N. 

3.3.1 Reorganizing the action of single-trace sources 

To begin, recall the definitions 

PA = Tta^ = TrA= - (3.5) 

Recall also that dropping the multi-trace contributions and computing only 

Cringle = 
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(3.6) 



would give a product of separate unitary transformations on A and that do not change 
Sa- Similarly, we may note that the entropy of pA must be identical to the entropy of 

pT = Ttac (C/T4i,a„f/Single) = [C/±gle]“(3.7) 
where is the part of C/singie that acts on A. We thus have 

SA=-Tt^(/>riogpr). p-8) 

Now, ctq. itself is defined (see (2.3)) by conjugating |0)(0| with a time-ordered unitary 
U. This unitary is the product of many factors of the form 

g iaj{t)5t _ g g (3 9) 

where we have separated the single- and multi-trace parts of the source at time t. Note that 
our new conjugation by C/singie merely replaces the U in (2.3) by or equivalently 

replaces each factor (3.9) with 

(3.10) 

where ■7^{i(t) is a multi-trace term conjugated by a time-ordered exponential built from 
single-trace sources at earlier times; there is no effect on (3.9) from sources at later times 
as these merely cancel between and U. One may make an analogy between U and 

the time-ordered exponential that implements Heisenberg-picture time-evolution in some 
quantum system, with Jsingie playhig the role of the free Hamiltonian and Jmulti playing the 
role of the interaction terms. Our conjugation by t4ingie then plays the role of passing to 
the interaction picture, where the new time evolution is a product of factors like (3.10). 

We wish to write 

pr =Pgo(l+PijAp“"J) (3.11) 

for 

^pT = pT - PAO = TVap 

and to expand (3.8) in powers of We then expand in powers of a. In 

this latter step will write each Ap™“^ as a sum of terms of the form 

1^3' (iciii-i. i4”;i- 2 ■ ■ ■ iciii-i. i»>{oiii ■■■!). (3.13) 

where is a time integral of and the factors are appropriately time-ordered. 

It is important to note that |0)(0| is not conjugated by any such unitary; the privileged 
position of this operator at the end of the chain of repeated commutators means that, in our 
analogy with transforming between the Heisenberg and interaction pictures, this operator 
can be thought of as labeled by the earliest possible time so that the picture-changing 
transformation acts on it trivially. The point of the form (3.13) is that we will shortly (see 
section 3.3.2) transform the expansion of (3.8) into a form involving only vacuum correlators 
of products of operators. The repeated commutator structure of will then forbid 

single-trace terms from appearing in correlators unless they are appropriately connected to 
multi-trace terms. 
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3.3.2 The entropy as a correlator 

The next step in onr argnment is to show how Sa can be written as a snm of vacnnm 
correlators of prodncts of the O* with fnnctions of the boost Hamiltonian K, and where the 
remaining iV-dependence of each term follows directly from powers of a = eN and (2.10). In 
particnlar, all extra traces over will be removed, and the modnlar Hamiltonians Ha, Ha<^ 
will not appear except in the combination K = Ha — Ha^^ ■ This last featnre will be critical 
in controlling contributions from disconnected correlators. 

As a first step toward this goal, we may nse (3.11) to express Sa as the vacnnm 
correlator 


Sa 


-Tr^ 


pao[i + p-A^pT]^^^p7 


^conj 


(i+piApr)iogpr), 


(3.14) 


where operators on region A are to be interpreted as operators on the fnll CFT Hilbert space 
by tensoring them with the identity on A'^. Note that log^p™’^^ <8) 11 a=^ = + 

log 1 Ac = flog Pa may interpret log/?“"l as the logarithm of an operator on 

the fnll Hilbert space. 

We next remove the explicit traces over These enter throngh the definition (2.6), 
and are potentially problematic becanse the sonrces involve prodncts of operators on A 
with operators on A^. The trick to proceeding is to nse the assnmption that each term in 
(2.10) is snpported away from dA (so that contribntions to each sonrce-term from A and 
A^ commnte with each other) to write each term (3.13) as a snm of terms in which all 
operators on A'^ have been commnted to act directly either on |0) from the left or on (0| 
from the right. 

Using the entanglement properties of |0), we may now replace each operator in A^ by 
a so-called “mirror operator” on A; see e.g. [46], thongh the particnlar terminology is from 
the more recent [47]. To explain how this works, let ns for the moment take A to be the 
“sonthern” hemisphere of onr S'^~^ at t = 0. In any relativistic theory, given an operator 
O, not necessarily a scalar or even local, we may stndy the CPT conjngate operator . 

Since the parity operation exchanges the north and sonth hemispheres, the CPT conjngate 
of any northern hemisphere operator at t = 0 (i.e., on A'^) is an operator on the t = 0 
sonthern hemisphere A. Fnrthermore, these operators satisfy 


= e^/^O\0); (3.15) 


where the second relation is jnst the Hermitian conjngate of the first. These are jnst Knbo- 
Martin-Schwinger (KMS) relations in terms of the imaginary time evolntion generated by 
onr K, so they encode the thermal natnre of |0) with respect to Ha- Thns the mirror 
operator O := ^-^KI 2 satisfies 

O|0) = C>|0); (0|Ot = (0|C>te^. (3.16) 

The key point is that the CPT conjngate of an O* is jnst another (or perhaps 
a linear combination thereof). So at the expense of introdncing additional factors of 
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^ we may replace the operators acting on (Tq by CPT-conjugate 
operators acting on A. Conformal invariance then guarantees that we can again perform 
a corresponding operation to replace operators on AA with those on A for more general 
ball-shaped regions. The new operators may then be commuted past ^d-operators 

to act on (To, where they may be replaced by factors of using (2.8). The net result is 

thus to write as a sum of terms of the form 

Tr^c . 

(3.17) 

Operators supported away from AA can now be pulled outside the trace over AA. By 
assumption these include all but we should take care with the factors of which 

include support near dA. We may do so in each term by using the Zassenhaus formula 

gX+y ^ gXgyg-[x,y]/2gi(2[y,[x,y]]+[x,[x,Y]]) _ _ _ 

for 77^/2 = A -|- y, with X = i7^''/2 a Hermitian integral of the subtracted stress tensor 
NO^ weighted by a smooth function supported away from dA and Y a similar integral 
supported close enough to dA to avoid overlap with the support of any appearing in 
the given term. Since both are smooth integrals, the supports of X and Y will overlap with 
each other and thus yield non-zero commutators in (3.18). It is convenient to collect all 
factors involving Y into a single operator Note that commutators of X,Y receive 

contributions only from regions in the support of both operators, so that is again 

supported in a region close to dA that avoids overlap with the support of any appearing 
in the given term. 

We now have 

g+H^/2 ^ g+//J“/ 2 g+H(f 72 
g-H^/2 ^ g-//““V2g-H(fV2^ 
g+HA/2 ^ g+//^“/2g+H““V2^ 

where (3.19b) is the inverse of (3.19a) and the final two relations (3.19c), (3.19d) are the 
adjoints of (3.19a), (3.19b). The expressions (3.19) allow us to safely reformulate (3.17) as 

X Tr^c (e-^“"/2g+Hj-/2g+H(f72|o)(o|e+^A”/2g+^^r“/2g-^/r“/2^ 

X (3.20) 

Since is the inverse of these operators all cancel. The remaining factors 

of may then be extracted from the trace as well. Reversing the steps involving 

j^near,^far reconstructs the original factors of outside the trace, leaving just 

(3.17) where the trace acts only on |0)(0| as one would naively expect. Using Tr^c |0)(0| = 
then removes all explicit traces over AY. While we could take this final factor of 


(3.19a) 

(3.19b) 

(3.19c) 

(3.19d) 
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to cancel the two factors of adjacent to |0)(0| in (3.17), due to the structure of 
(3.13) it is convenient to instead keep all of these factors explicit. The latest factor of 
coming from the trace is distinguished, and we shall refer to it below as the ‘final’ . 

The above steps have written each factor of Ap™"-’ as a product of 0\ with two factors 
of two factors of and our final e~^^. But as noted above we wish to expand 

in powers of We may then use 

1 = ^Hac^-Hac^-HacI2^-HacI2^HacI2^HacI2 /g 


to insert the indicated factors of Commuting them past the and exponentials 

of Ha as needed allows us to replace all exponentials of Ha with exponentials oi K = 
Ha — Ha<^- Ill particular, in this way we obtain a distinguished ‘final’ factor of e~^. 

We are now close to achieving our goal. If p(^gAp™“'' would commute with pAo, we 
could use logp™'^-’ — logpyio = log^l + Pao^P^A^^') expand (3.14) in a standard Taylor 
series. Combined with our results above, this would give an infinite sum of terms with each 
being a vacuum correlator of a product of Oi and exponentials of K, consistent with the 
form required above. ^ 

But since pjjgAp^^-' and pAo generally do not commute we must use the Baker- 
Campbell-Hausdorff formula to evaluate log = log 
formulation of this identity due to Dynkin takes the form (see e.g. [48]) 


PAoil + PAlApr) 


■ 


An explicit 


log(e^e^) 


E 

k=l 


(- 1 ) 


k-1 


E 

Pi+gi>i 


(Eti(K+«))(nt,K!«!)’ 


(3.22) 


where 




= [A,[A,...[A, [y,...[y, [A,...[A, [y,[y,...y].. .] . ( 3 . 23 ) 


Pi 2Cs 


<?i 


Vs 


Pk Xs 


Qk Ys 


We wish to set A = —Ha = logp^o and Y = log^l + p^gAp™"^^. 

The important observation is that the form of p^gAp^ found above clearly commutes 
with Ha^^, so [Ha<^, y] = 0 as well. Recalling from (2.7) that A = —Ha also commutes with 
Ha'^ allows us to replace each A in the above repeated commutators with —K = Ha‘^ — Ha- 
The result is an expression for Sa as an infinite sum over terms, each of which is a vacuum 
correlator involving only products of and functions of K, and with all further explicit 
dependence on N coming from (2.10) as desired. 


3.3.3 Counting powers of N 

We are now ready to count powers of N. Consider then contributions at order involving 
r single-trace operators Ofrom single that come from single-trace source-terms and s single¬ 
trace operators Ofrom multi that come from multi-trace source-terms. Such a contribution 

®With the exception of the term Ap“"Wyi). This term resembles the mulit-trace contributions to 
A{logpyio) considered in section 3.1 and is also of order for similar reasons. 
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is equal to a product of connected correlators each with r* of the Ofrom single and Si of the 
dfrom multi such that r = ri and s = YZi need not keep track of the factors of 

K since each contributes an explicit factor of N that is cancelled by the extra 1/N from 
(2.1) associated with the it contributes to any correlator. Each connected correlator 
gives ri + Si — 2 factors of 1/N by (2.1). The single-trace sources provide an additional r 
explicit powers of N. For double-trace sources the explicit N from a = eN cancels against 
the l/N in (2.10) and for higher-trace sources the contribution is more suppressed. So the 
total number of factors of l/N is greater than or equal to ~ 2 )- We wish to show 

that this sum is non-negative. 

Recall that the reason multi-trace sources affect the entropy is that they can contain 
products of operators in A with operators in A^. But a given term will only be sensitive to 
the correlations created by these sources if at least one operator from A and one from A'^ 
appear in the same connected correlator. So it is natural to expect that the entropy will 
only receives contributions from terms with s* > 2 for all i, which would imply the desired 
result. 

To show that this is so, we note that there are no contributions from terms with Si = 0. 
This is because all of the O^om single occur in nested commutators inside some and 

therefore must be connected to at least one Ofrom multi in order to contribute. 

Now suppose Si = 1, where we take the relevant Ofrom muiti-j to come from a m-trace 
source term living inside some Since (Ofrom multi-j) = 0, a non-vanishing connected 

correlator must involve other operators. None of these can come from single-trace source 
terms in other for k ^ j, which must instead stay attached to their own multi-trace 

sources. So since Sj = 1, if any of these operators come from a final K then the correlator 
must vanish due to ii'IO) = 0 = (Ol/C and the fact that (3.13) prohibits a final K from 
intervening between factors coming from any given So the correlator consists only 

of Ofrom multi-j) single-trace source-terms, and K's coming from a single Terms of 

this form exponentiate, and have the effect of replacing the operator Ofrom multi-j in 
with a classical source of order N, demoting to an (m — l)-trace source of order 

jy-m-3 same order as the {m — l)-trace sources already appearing in (2.10)). So 

we need not consider such terms separately when counting powers of N. In particular, 
double-trace sources demoted to single-trace sources in this way can be absorbed into the 
t^singie of Section 3.3.1 and do not contribute to the entropy. So all relevant terms have 
Si > 2 and the largest possible contribution to the entropy is of order as anticipated 
above. 

4 Discussion 

The work above contains first steps toward studying the von Neumann entropy of excited 
states in CFTs satisfying the ’t Hooft large N counting rule (2.1). We considered the 
entropy of ball-shaped regions for states produced by real-time sources with non-singular 
action on the modular Hamiltonian and of the form (2.10). From a dual bulk point of view, 
such sources produce small classical waves - or more properly quantum coherent states 
with large amplitude of order eN - on both sides of a Rindler horizon but which do not 
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disturb the bifurcation surface itself. They also add 0{N^) entangled sets of particles across 
the horizon. As the former does not change the HRT entropy, one expects that the CFT 
entanglement changes only at order We have verified that this is indeed the case by a 
direct argument in the CFT. While our results are directly formulated in terms of a 1/N 
expansion, it may be interesting to follow [5] and attempt to formulate a version of our 
results that would hold in arbitrary CFTs, whether or not they have a large-A^ counting 
rule like (2.1).® 

One consequence of our work is the explicit formula (3.2b) for the order relative 
entropy. When interpreted in the bulk theory, this formula is precisely the bulk stress tensor 
on the A side of the bulk horizon, integrated so as to give the associated change in the boost 
energy K. This may be seen from the fact that (3.2b) is just the contribution to K from 
the excitations on the A side of the bifurcation surface, and by bulk causality that it thus 
gives the value of the boost Hamiltonian AT in a related bulk solution produced from the 
vacuum using only the sources Oa in the causal past of A. The above claim then follows by 
writing K in terms of a standard ‘bulk stress tensor’ that includes quadratic contributions 
from gravitons as well. For concreteness, one may choose the graviton contribution to be 
given by the AdS analogue of the Landau-Lifshitz ‘pseudo-tensor’ expression [49], though 
our requirement that perturbations vanish near the bifurcation surface means that many 
other choices give equivalent results^. 

The above observation means that our analysis provides a new argument for the univer¬ 
sal coupling of gravity to all classical forms of stress-energy. Though it differs in detail from 
[4], this interpretation of our classical {0{N‘^)) result is inspired by the quantum {0{N^)) 
argument of that reference and reinforces the connection found there between the universal 
structure of variations in Sa and the universal coupling of bulk gravity. 

Another consequence is to give a derivation of the order Faulkner-Lewkowycz- 
Maldacena correction [23] to the Ryu-Takayagi and Hubeny-Rangamani-Takayanagi con¬ 
jectures (for ball-shaped regions A). In the semi-classical bulk, this argument is perturbative 
in departures from empty AdS. It thus complements the original reasoning in [23] in that 
it does not rely on the Lewkowycz-Maldacena argument [12], or on any other use of the 
replica trick. At first order in e the result follows from the first law, but the arguments 
of section 3 also give this result at higher orders. In general, we may decompose Sa into 
a first-law-piece and Ra- Recall that at second order Ra is quadratic in the first order 
contribution to and in particular is given by 

- z«TVAc([Jdoubie, |0)(0|]) = TrAc(e-'“'^“|0)(0|e'“''“ - |0) (0|) + 0{e^), (4.1) 

where Jdouble is the double-trace part of (2.10). As before the only terms in Jdouble flint 
can contribute to IS.Sa are those that involve one operator in A and one in A'’. Such terms 
throw entangled pairs into the bulk, with one member of each pair on each side of the bulk 

®We thank Tom Faulkner for discussion on this point. 

^There is, however, a preferred expression when perturbations do not vanish at the bifurcation surface. 
The bulk relative entropy Ha — S' of the classical bulk is then the so-called canonical energy density of [50], 
which is naturally expressed as an integral of what one may call a bulk stress tensor. See [51] for further 
discussion of this connection. 
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horizon. Given the agreement of bulk and CFT vacuum correlators, it is manifest that the 
corresponding change in bulk entanglement at this order can be computed just as we have 
done in the CFT and that the results agree. A similar argument holds at higher orders in 
e, where the repeated commutator structure of (3.13) reproduces the effect of propagating 
small effects from the multi-trace sources through the large semi-classical coherent state 
produced by the single-trace parts of (2.10). It also extends to higher orders in N to argue 
that bulk entanglement gives the full series of 1/N corrections to HRT. Note, however, that 
since our sources are confined to D{A) U D{A^) all perturbations vanish at the bulk HRT 
surface. Our derivation is thus insensitive to possible perturbative shifts of the HRT surface 
of the kind predicted in [52]. Conversely, extending our results to allow sources supported 
on dA would in principle allow us to test the conjecture of [52] that the full CFT entropy is 
given by the generalized entropy of a quantum extremal surface, defined as the bulk surface 
that extermizes the bulk generalized entropy. 

We have focussed on ball-shaped regions for simplicity, but we expect our arguments to 
generalize to arbitrary regions A. Indeed, as described in [53], one may address perturbative 
deformations of ball-shaped regions by inserting additional factors of the stress tensor (or, 
equivalently, of NO^). These insertions tend to add another operator to each connected 
correlator, giving an extra \/N from (2.1) that cancels the explicit new factor of N. A 
similar argument can be used to compute corrections to (3.15) and construct the relevant 
“mirror operators” for these deformed regions. So the only ingredients of our analysis that 
remain to be checked are that Ha, Ha<= can be thought of as operators on the full CFT 
Hilbert space that commute both with each other, and with local operators supported away 
from dA. It should be possible to analyze these assumptions perturbatively as well. We 
expect that these will indeed hold at this level, but verifying this is beyond the scope of 
our work. 

The extension to include sources supported near dA would clearly be of great interest. 
At linear order in e the contribution to Sa from such sources is governed by the first law 
and was effectively studied in [2, 3]. But at this order there is no displacement of the bulk 
extremal surface. In particular, for ball-shaped regions A the bulk HRT surface continues to 
coincide with the bifurcation surface of the corresponding event horizon; i.e., with the causal 
holographic information surface of [54], which is expected to compute some coarse-grained 
version of the CFT entropy [54-56]. In contrast, the two are distinguished at second order, 
so comparing bulk and CFT computations may give insight into the nature of the relevant 
coarse-graining. In particular, one might hope to either support or falsify the conjecture 
[56] that it corresponds to the maximizing the entropy over all states for which certain 
one-point functions coincide with the original state. 

In addition, comparing bulk and CFT computations to second order would derive or 
falsify the HRT conjecture at a non-trivial level. Even for static perturbations the fact 
that it avoids the replica trick would make this a useful complement to the Lewkowycz- 
Maldacena argument [12], and the perturbative method should be able to address general 
time-dependence to which [12] does not apply. And since second-order results are sensitive 
to the displacement of the HRT surface, they could in particular detect any possible motion 
of this surface in imaginary directions within the complexified AdS spacetime; i.e., they 
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could help diagnose the possible role of complex extremal surfaces as explored in [14], 

One reason that we have avoided such singular terms here is that they are in principle 
sensitive to the particular way that entanglement is to be defined in the CFT. Since the 
CFT is a gauge theory, this can involve a number of subtle issues [35-44]). Lewkowycz- 
Maldacena [12] suggests that the correct notion of CFT entropy is defined by the replica 
trick, which is precisely the computational tool we wish to avoid. However, the results of 
[35-44]) also suggest that the various definitions of entropy differ only by a local boundary 
term that will cancel in computing the mutual information between pairs of regions A, B. 
One should thus be able to ignore such concerns in this context. We hope to compute the 
‘singular’ second order terms and to explore the above issues in the near future, perhaps 
using a suitably-generalized version of the calculation in appendix A. 
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A Computing the relative entropy 

This appendix derives an explicit formula for the second order change 6^Ra in the relative 
entropy Ra corresponding to an arbitrary change 6pA in the reduced density matrix for 
A. This result is not directly used in the main text, other than writing (3.2a). Our final 
expression (A.14) bears a striking resemblance to Eq. (C7) of [5]. In fact, (A.14) can also be 
derived from a straightforward generalization of the calculation leading to (C7). We present 
a different, somewhat more involved calculation here because we hope that this approach 
will be useful for analyzing the additional terms at order that arise when the sources in 
(2.10) do not vanish in a neighborhood of dA. 

The setting for the calculation below is an arbitrary bipartite quantum system, meaning 
that it is the tensor product of a system on A and one on A^. As discussed in the main text, 
the actual system we study is not strictly of this form, though it can be treated as such 
at least under our assumption that each term in (2.10) vanishes in a neighborhood of dA. 
So a critical step in realizing the hope expressed in the paragraph above is understanding 
modifications that arise when this assumption fails. 


17 - 


A.l The Baker—Campbell—Hausdorff Formula 

We will compute S'^Ra using the BCH formula in the form (3.22). Since we work only to 
order it will be useful to rewrite (3.22) as 

oo oo n—1 

log(e^e^) =X + Y + Y^ + + o{Y^) , 

n=l n=lfc=0 

(A.l) 


where Cn and are rational numbers. It was shown in [45] that 

(A 

where Bn are the hrst Bernoulli numbers {Bq = 1, i?i = —1/2,...). The Bernoulli numbers 
can be dehned by the smooth generating function 


Cn = 


{-iTBn 


n: 


f{y) ■= 


y 

1 - e-y 


^ p 

Y^(-yy 

n=0 


(A.3) 


where the sum on the right hand side converges for y G (—27r,27r). Note that i? 2 m+i = 0 
for m > 1. 

To compute 5‘^Ra we will also need the coefficients Dnfl- To our knowledge these 
coefficients have not previously been explicitly computed. But a straightforward application 
of the recursive technique developed in [45] yields® 


Dnfi — — 


2 (n + l): 


72+1 

E(-i) 


n—k 


k=l 


n + 1 
k 


Bk B, 


B. 


k^n+l—k — 


72+1 


2 (n!) 


(A.5) 


To obtain the second equality hrst note that for even n the only non-vanishing terms in 
the sum are k = 1 and k = n, and that these terms cancel. For n = 1 the equality is 
easily checked by hand. For odd n > 3, only the even k terms survive so ( —l)””^ = 
— 1. The remaining sum is easily evaluated using a well known identity due to Euler and 
independently rediscovered by Ramanujan (see for example Eq. (1.2) in [57]) 


' \ [2m\ 

^ ^ J = -‘^rn B2m , m > 2 . (A.6) 


^Iterating Eq. (12) of [45] in such a way as to obtain all terms with two T’s gives 


log(e-^e^) = X + y + 


oo „1 / oo \ 

Y^Rk dti Adx + tAdy + ^ 7?„[Adx^"*’tAdy] + ... y 
k^l "'0 V m=l / 


(A.4) 


where R„ = (—1)”Bn/n\ and AdzlF = IZ,W]. After collecting all of the the terms that give nested 
commutators of the form we obtain the first equality in (A.5). 
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A.2 Computing 8^Ra 

Taking a second variation of Ra gives 


5^Ra = Tr(5pA(5[log(yOA)]) + Tr(pA<5^[log(/9A)]) • (A.7) 

To evaluate the variation of the logarithm we first rewrite 

log(pA + 8pa) = log[e"'^'^(l + e^^dpA)] (A.8) 

and apply (A.l) with X = —Ha and Y = log(l + e^^SpA)- We then Taylor expand Y 
using 

log(l + Z) = Z - ^ + 0(Z3). (A.9) 

None of the Dn,k>i terms from (A.l) contribute to (A.7) because they can only appear in 
the second term of (A.7) and 


Tv(^PA[H‘^-^\p-/SpA)^^^H^r"\p~A"^PA)^^'^ 


which vanishes by cyclicity of the trace. The remaining terms give 


/ Br, 


-Tvi^A^Spl) 

n=l '' 


Ra 


H 


(n) 


Spa’ ^Pa) 


n=l 


Note that when [Ha,8pa] = 0, (A.11) reduces to the term studied in [53]. 

We now evaluate the sums in (A. 11). It is useful to rewrite the first trace as 


Tr e 


Ra 


H 


(n) 


n / ^ 

5p^I^]5pa) = E(-l)V) ^{e^^Hl-^5pAH{5pA) 

j=0 

n /X 

= Y.^-iyr.){e^^Hl-^5pAH^A^pAe^^) 


= {{e^^6pA){-Kr{6pAe^^)), 


(A. 10) 


OO 

+ J2{-y"Dn,oTt[[{e^^f^H^A^6pA]SpA) • (A.ll) 


(A.12) 


where the first line is straightforward to prove by induction. In the second equality we have 
inserted a factor of e^^pA = 1 into the trace to convert it to an expectation value with 
respect to the reduced density matrix pA- Since all of the operators inside the expectation 
value live on A, this is equivalent to the expectation value with respect to the full density 
matrix p = |0) (0|, which, as in the main text, is denoted by {■■■)■ The last equality follows 
from Ha\0) = 77a=| 0), the definition K := Ha — Ha<^-, and the binomial expansion. 
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Similarly, the second trace in (A. 11) can be rewritten in the form 







= E(- 1)' ( j) (( I®"" ('Sp/1«"")) 

j=0 ^ 

= [(- 1 )” - \\{{fHpA)K^{ppAe -^^)), 


(^{e^^5pA)H'l ^H^j^,{5pAe^^)yj 

(A. 13) 


which clearly vanishes for even n. Inserting (A.12) and (A.13) into (A.11) and performing 
the sums gives 

5^Ra = {e^^5pA[f{K) - f{K)]5pAe ^^), (A. 14) 

where / is the smooth function defined in (3.3). Note that here (and in previous expressions 
involving correlators) the operators Ha, SpA should be understood as tensor products with 
the identity operator on A'^. We computed the above sums using (A.3), (A.5) and the trick 



Our manipulations of the infinite sums (A.3) and (A. 15) are somewhat formal since 
each sum converges only for y G (—27r, 27r) while the eigenvalues of Ha are unbounded. 
Nevertheless, since both the initial and final expressions involve only analytic functions of 
Ha, we expect that with appropriate care the final result can be justified for general Ha- 
The representation of log p used in e.g. appendix C of [5] may be useful for this purpose. 
Note that since K is Hermitian and 

m - fiy) = > Q 

for real y, the eigenvalues of f{K) — f'{K) are real and positive. This fact implies 6^Ra > 0, 
with equality if and only if 6pAe^^ annihilates the vacuum. But since the restriction of 
|0) to A is non-degenerate, this can occur only if SpA = 0. Thus (A. 14) is consistent with 
positivity of the relative entropy. 
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